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Abstract 

We give improved pseudorandom generators (PRGs) for Lipschitz functions of low- 
degree polynomials over the hypercube. These are functions of the form i}/{P{x)), where 
P : {1,-1}" — > R is a low-degree polynomial and \p : R — > R is a function with small Lips- 
chitz constant. PRGs for smooth functions of low-degree polynomials have received a lot of 
attention recently and play an important role in constructing PRGs for the natural class of poly- 
nomial threshold functions. In spite of the recent progress, no nontrivial PRGs were known for 
fooling Lipschitz functions of degree 0(log n) polynomials even for constant error rate. In this 
work, we give the first such generator obtaining a seed-length of (\ogn)0{€ 2 1 s 2 ) for fooling 
degree I polynomials with error e. Previous generators had an exponential dependence on the 
degree I. 

We use our PRG to get better integrality gap instances for sparsest cut, a fundamental 
problem in graph theory with many applications in graph optimization. We give an instance of 
uniform sparsest cut for which a powerful semi-definite relaxation (SDP) first introduced by 
Goemans and Linial and studied in the seminal work of Arora, Rao and Vazirani [ARV] has 
an integrality gap of exp(Q((loglogn) 1 ^ 2 )). Understanding the performance of the Goemans- 
Linial SDP for uniform sparsest cut is an important open problem in approximation algorithms 
and metric embeddings. Our work gives a near-exponential improvement over previous lower 
bounds which achieved a gap of Q(loglogn) [DKSV, KR]. Our gap instance builds on the 
recent short code gadgets of Barak et al. [BGH + ]. 



1 Introduction 

We study the natural question of constructing pseudorandom generators (PRGs) for functions of 
low-degree polynomials over the hypercube. For instance, an important class of such functions 
is polynomial threshold functions (PTFs). These are functions of the form / : {1,-1}" — » {1,-1} 
given as f(x) = sign(P(x)) where P is a real- valued multivariate polynomial. PTFs are an important 
class of functions with a variety of applications in complexity theory [Bei], learning theory [KS2], 
voting theory [ABFR] and more. Recently, a lot of attention has been given to the problem of 
constructing PRGs for PTFs - [DGT, RSI, DKN, MZ, Kanl, Kan2]. At the core of most of these 
results is a PRG that fools smooth functions of polynomials. For a function iff : R —* R, let 
\\iff\\u P = sup^ \if/(x) - ifr(y)\/\x - y\. 
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Definition 1.1. We say a generator G : {0, \ } r — » {1,-1}" s-fools Lipschitz functions of degree I 
polynomials if for every function if/ : R — > R anJ degree I polynomial P with \\P\\ = 1 

E mP(G(y)))]- E WPW)] 



< £ ■ UVWLip- 



Non-explicitly there exist generators as above with seed-length r = 0(\ogn + £\og(\/s)). In 
this work, we give the best known explicit PRG for Lipschitz functions of polynomials: 

Theorem 1.2. There exists an explicit generator G : {0, l} r — > {1,-1}" that fools Lipschitz func- 
tions of degree £ polynomials with error s and has seed-length r = 0((\ogn + \og 2 (i/s)) • £ 2 /s 2 ). 

In contrast, the best previous constructions, [MZ], had at least an exponential dependence 
on the degree i in the seed-length. In particular, no non-trivial PRGs were known for fooling 
Lipschitz functions of degree 0(log n) polynomials, and as a result for degree 0(log n) PTFs, even 
for constant error rate. 

As mentioned above, at a high-level, most of the previous PRGs for PTFs worked by first 
constructing a PRG for fooling smooth functions as above (with additional assumptions on the 
polynomials P) and then approximating the sign function suitably by smooth functions. Thus, our 
generator can be seen as a significant step towards obtaining nontrivial PRGs for PTFs of degree 
O(logn). 

Besides the natural interest in constructing PRGs as in Theorem 1 .2 in the context of fooling 
PTFs, an additional motivation for the question (which drew us to the problem in the first place) is 
the following application to hardness of approximating uniform sparsest cut. 



1.1 Integrality Gaps for Uniform Sparsest Cut 

Uniform sparsest cut is a fundamental problem in graph theory with a variety of applications in 
graph optimization and often appears as a basic step in several important approximation algorithms 
(for e.g., see the survey by Shmoys [Shm]). 

Given an undirected graph G = (V, E), the goal in the uniform sparsest cut problem is to find 
the subset of vertices S Q V that minimizes \E(S, S)\/\S \\S |, where E(S,S) denotes the number of 
edges between S and S . 

The uniform sparsest cut problem and its generalization-non- uniform sparsest cut 2 , have been 
extensively studied in the context of approximation algorithms. The exact problem is known to 
be NP-hard [SM], but no NP-hardness results are known for approximating the optimum. On the 
other hand, constant-factor hardness of approximation results are known for non-uniform sparsest 
cut version assuming the unique games conjecture [CKK + , KV]. 

Designing approximation algorithms for sparsest cut has received a lot of attention in algorithm 
design, culminating in the seminal work of Arora, Rao and Vazirani [ARV] who gave a 0( -^log n) 
factor approximation for uniform sparsest cut. This was later extended to non-uniform sparsest cut 
by Arora, Lee and Naor [ALN]. 

Arora, Rao and Vazirani in fact showed that an SDP relaxation for the problem first proposed 
by Goemans and Linial in the late 1990s, [Goe, Lin], has an integrality gap of 0( yjlog n) for 

'Throughout, for a multi-set S, x ~ S denotes the uniform distribution over S and ||P|| = E A -~|i -iy[P(x) 2 ] 1 ^ 2 . 
2 Here, one allows demands on the edges; we do not define the problem formally as we do not use it later on. 
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uniform sparsest cut. Since then, understanding the performance of the Goemans-Linial SDP (so- 
called "basic SDP with triangle constraints") has been of much interest. A significant step in 
this direction was taken by the breakthrough work of Khot and Vishnoi [KV] who showed that the 
SDP relaxation for non-uniform sparsest cut has an integrality gap of at least (log log n) 1/6_o(1) , thus 
disproving a conjecture of Goemans and Linial. Following this, Devanur et al. [DKSV] showed that 
the Goemans-Linial SDP has a gap of Q.(log log n) even for uniform sparsest cut. Finally, Cheeger, 
Kleiner and Naor [CKN] showed an integrality gap of (log n) n(1) for non-uniform sparsest cut. 

In this work we obtain a near-exponential improvement in the integrality gap of the Goemans- 
Linial SDP for uniform sparsest cut. (We refer to Section 3 for the formal definition of the SDP). 

Theorem 1.3. The Goemans-Linial SDP relaxation for uniform sparsest cut has an integrality gap 
of at least exp (Q( -^log log n) ). 

We remark that the only better gap instance in this vein of Cheeger, Kleiner and Naor for 
non-uniform sparsest cut is known to not work for uniform sparsest cut. This difference between 
uniform sparsest cut and non-uniform sparsest cut is particularly interesting in the context of metric 
embedding, which we explain below. 



1.2 Embedding Negative-type Metrics into t\ 

There is a strong connection between embedding metrics into l\ and the sparsest cut problem as 
was first evidenced in the work of Linial, London and Rabinovich [LLR] . By now it is well under- 
stood (for e.g., see [Rab], [CKN]) that the integrality gap of the Goemans-Linial SDP correspond 
exactly to the distortion of embedding negative-type metrics 3 into l\. In particular, integrality gaps 
for the Goemans-Linial SDP for non-uniform sparsest cut correspond to worst-case distortion of 
such embeddings and integrality gaps for uniform sparsest cut correspond to average-distortion of 
such embeddings. Cheeger, Klein and Naor [CKN] show that the worst-case distortion can be as 
large as (logn) n(1) . However, as mentioned in their work, their metric does in fact have a small 
average-distortion embedding into l\ . Our gap instance immediately gives the following corollary. 

Corollary 1.4. There exists a negative-type metric on n points that requires an average distortion 
of at least exp (Q( -y/log log/i)) to embed into l\. 



1.3 Hierarchy Lower bounds for Sparsest Cut 

Finally, we remark that our improved gap instance also translates to improvements in hierarchy 
lower bounds for uniform sparsest cut. SDP hierarchies are one of the most powerful techniques 
in algorithm design and knowing their limitations for specific problems often gives strong uncon- 
ditional evidence for the hardness of the problem at hand (see [CT] for a recent survey). This 
is even more compelling for problems where we do not have NP-hardness results as is the case 
for uniform sparsest cut. One such important class of hierarchies is the Sherali-Adams hierarchy, 
which we augment here by starting with the basic SDP relaxation. Indeed, the Goemans-Linial 
SDP relaxation for sparsest cut is contained within a constant number of levels of this hierarchy. 

3 A metric space (X, d) is said to be of negative type if it embeds isometrically into f%: there exist f : X — > & such 
\h a .td( X ,y) = \\f(x)-f(y)\\ 2 r 
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We defer a more precise description of the hierarchy to Section 3. Our lower bound for uniform 
sparsest cut in this hierarchy is as follows. 



Theorem 1.5. The integrality gap after R rounds of the Sherali- Adams hierarchy for uniform spars- 
est cut starting with the basic SDP relaxation is at least exp (0( ->/log log nfj /R. 

The above result gives co(\) integrality gap for R = exp(Q( ^log log n) ) rounds. In contrast, 
the best previous results (even for the harder non-uniform sparsest cut problem) of Khot and Saket 
[KS1] and Raghavendra and Steurer [RS2] had non-trivial guarantees for at most (log log 
rounds. 



2 Proof Outline 

2.1 PRGs for Lipschitz Functions of Polynomials 

The basic generator we use is similar to that of Rabani and Shpilka [RS 1 ] and Meka and Zuckerman 
[MZ]. However, our analysis is quite different and is arguably simpler. In particular, we do not 
need to appeal to the invariance principle for low-degree polynomials of Mossel, O'Donnel and 
Oleszkiewicz [MOO]. 

The generator we consider is the following. Fix n,t,m = n/t and let = {h : [n] — > [t]} be a 
family of hash functions. Let 8 > be a parameter to be chosen later and let G/, : {0, l} n — > {l,-l} m 
be a PRG that fools halfspaces with error at most 8. Let G<h,g h : {0, l} r — » {1, -1}" be the generator 
which samples a hash function h ~ H, partitions the coordinates into buckets according to h and 
uses an independent sample generated according to Gh to set the coordinates in each bucket. We 
remark that the generator is similar to those in Rabani and Shpilka and Meka and Zuckerman with 
a different choice of Gh- Let F : {0, 1}* — » {l,-l} m be a 2^-wise independent family chosen 
independently of G<H@ h . We define a new generator G<h,g h ,f by taking the component- wise product 
of G<h g h an d F. 

We show that G<n,G h ,F fools Lipschitz functions of degree t polynomials with error s when 
there are t = 0(£ 2 /s 2 ) blocks and the inner half space generator Gh fools halfspaces with error 
8 = 0(s 2 /£). The analysis of the generator proceeds as follows. 

Let \p be a Lipschitz function and P a degree i polynomial. Call a monomial in P bad if the 
random hash h ~ *H assigns more than one of the coordinates in the monomial to a single bucket. 
Delete all such bad monomials to get a polynomial Ph- We use pairwise independence of the hash 
family H to argue that any fixed monomial is bad with small probability. In particular, the total 
weight of deleted monomials in P h will be small in expectation. Finally, we use a simple tail bound 
to argue that as iff is Lipschitz, the error due to going from il/{P{ )) to iff(Ph( )) will be small. 

Finally, note that P h is linear within the coordinates in each bucket and we know that G/, fools 
halfspaces with sufficiently small error. Combining this with a hybrid argument across the t buckets 
shows that for any fixed hash function h, the generator G^,G/„f fools if/{Ph{ )) with error at most 
8 ■ t. By setting the parameters appropriately and averaging over the choice of the hash functions 
h ~ H gives us the desired result. Theorem 1.2 then follows from using the PRG for halfspaces of 
Meka and Zuckerman [MZ] as Gh in G<h,g,„f- 
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2.2 Integrality Gaps for Uniform Sparsest Cut 



We next describe our results on uniform sparsest cut. The starting point for our improved gap 
instances is the short code graph (aka Reed-Muller graph) of Barak et al. [BGH + ], who gave an 
exponentially smaller gadget (the 'short code') which can be used in place of long code in certain 
hardness reductions. At a high-level, our gap instance for uniform sparsest cut is obtained by 
replacing the long code with the short code in the construction of Devanur et al. [DKSV]. We 
analyze our gap instance using the framework of Raghavendra and Steurer [RS2]. 

The gap instance of Devanur et al. is obtained by looking at the hypercube (viewed as a long 
code) and folding the graph along cyclic shifts. That is, by collapsing sets of vertices which are 
cyclic shifts of one another to a single vertex. Our graph on the other hand is obtained by looking 
at the short code graph of Barak et al. whose vertices correspond to elements of the Reed-Muller 
code. 

Unfortunately, the short code graph unlike the hypercube is not invariant under cyclic shifts. We 
get around this hurdle by observing that for the high-level intuition behind the folding operation of 
Devanur et al. to work, one only needs a group of automorphisms on the graph that are transitive 
on the dictator functions corresponding to the vertices. For Devanur et al., cyclic shifts satisfy this 
requirement. For the Reed-Muller graph of Barak et al. one such set of group actions is provided by 
affine shifts, where a F 2 polynomial P : F!J — > F 2 is mapped to the polynomial P a (x) = P(x+a), for 
a e K[. Thus, our final gap instance for uniform sparsest cut is obtained by folding (i.e., collapsing 
the vertices of) the short code graph of Barak et al. along the orbits of the above affine shift action. 

The approach of replacing the long code with short code to obtain stronger integrality gap in- 
stances is already present in [BGH + ] and in fact lead to significantly better integrality gap instances 
for unique games and various related constraint satisfaction problems. Unfortunately, making it 
work for the case of uniform sparsest cut (or even non-uniform sparsest cut) presents significant 
technical challenges and lead to no substantial quantitative improvements. In particular, a major 
bottleneck in the work of Barak et al. was the error parameter in a derandomized majority is sta- 
blest result over the Reed-Muller code shown by the authors. Using our improved PRG, we are 
able to show a much stronger (in fact exponentially better) majority is stablest result over the Reed- 
Muller code. This quantitative improvement provides the basis for the analysis of our integrality 
gap instances. 

We remark that even after obtaining the better PRG as in Theorem 1.2 it is technically chal- 
lenging to make sure all the pieces of [DKSV], [BGH + ] and [RS2] fit together to give our final 
gap instance for uniform sparsest cut. In particular, we have to redo several technical claims from 
[BGH + ] and [RS2] adapted to our specific context. We defer further details to the appropriate 
sections. 

3 Preliminaries 

3.1 Pseudorandomness 

We use the following standard notions from pseudorandomness. 

Definition 3.1. A distribution 2) on {1,-1}" is k-wise independent if for every I c [n] with \I\ < k, 
and x ~ D, the random variables ((jc,-)j£/) are independent and uniform over {1,-1}. 
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Definition 3.2. A family of hash functions H = {h : [n] [t]} is pairwise independent if for all 
i j 6 [n] the random variables h(i), h(j), h ~ H are independent and uniform over [t]. 

There explicit distributions D and hash families *H as above that can be sampled with 0(k log n) 
[AS] and 0(\ogn) [CW] random bits respectively. 

We say a distribution T> e-fools a class of functions T = {f : {1,-1}" — > {1,-1}} if for 
every f e T, Pjt-oL/W = 1] = Phi ,-u"[/W = 1] ± £■ We shall use the following results of 
Diakonikolas et al. [DGJ + ] and Meka and Zuckerman [MZ] about fooling halfspaces (these are 
functions of the form f(x) = sign««;, x) - 8) for w e R" and 6eR). 

Theorem 3.3 ([DGJ + ]). There exists a constant c such that for every s > and k > c log 2 (l/e)/e 2 , 
k-wise independent distributions fool halfspaces with error at most e. 

Theorem 3.4 ([MZ]). There exists an explicit PRG G : {0, l} r — » {1,-1}" that fools e-fools halfs- 
paces with a seed-length of r = 0(logn + log 2 (l/e)). 

We also use the following simple large-deviation bound for variables with limited independence 
(see appendix for proof). 

Lemma 3.5. Let Xi, . . . , X N e { 1 , -1} be k-wise independent for k even. Then, for all t > 0, 
P[IZi*;l > tyfN] ^k k ' 2 /t k . 

3.2 The Short Code 

We next review the short code graph or Reed-Muller graph of Barak et al. [BGH + ]. Throughout, 
the Reed-Muller code will correspond to the Reed-Muller code over F 2 . 

Definition 3.6. The n-variate Reed-Muller code of degree d, denoted RM(n, d) is a length 2" linear 
code with messages corresponding to n-variate polynomials P from F 1 ^ — > F 2 of degree at most d. 
The encoding of a polynomial P is (P(x)) xe p 2 . 

We shall often, without explicitly stating so, view RM(n,d) as a subset of {1,-1} 2 " via the 
mapping a e {0, 1} <-» (-1)" e {1, -1}. The meaning will be clear from the context. 
We shall repeatedly use the following standard facts of Reed-Muller codes. 

Lemma 3.7. The dual o/RM(n, d) is RM(n, n - d - 1) and the uniform distribution over RM(n, d) 
is 2 d -wise independent. 

We next abstract the main properties of the short code graph from [BGH + ]. For fixed values of 
n,d,N = 2" and a e F^, letfa : —> {1, -1} be the character defined by Xa(x) = (-l)< a ' A> and 
define 

deg(Xa) = rnin{ wt(a + y) : y e RM(n, n - d - 1) }. 

Barak et al. give a (weighted) Cayley graph whose vertex set is a Reed-Muller code and more 
importantly, whose spectrum closely approximates the spectrum of the Boolean noisy cube. 

Lemma 3.8 (See [BGH+]). For all s e (0, 1/8), n, d > 0, p = e' £ , N = 2", the following holds. 
There exists a weighted Cayley graph G = G(n, d, s) with vertex set V = RM(n, d) such that: 
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- The eigenvectors ofG are the characters \x a '■ a e Rj /RM(n, n — d - l)} 4 . Let A a denote the 
eigenvalue of character x a 

- The graph G is affine-shift invariant in the following sense. For any b e RJ, let 7T& : 
RM(n, d) -> RM(n, d) be defined by n h (P) = P' where for P e RM(n, d), P' is the poly- 
nomial with P'{x) = P(x + b). Then, is an automorphism on G. 

- For any a, if deg(x a ) = k, A a < max{p k/2 ,ff° 2d ) where po is an absolute constant. 

- For all S < 6q for some constant 60, if deg(x a ) = k < S 2 2 d+l , then \A a — p | < 6. 

- For any vertex u e RM(n, d) and v a random neighbor ofu in G, E[(w, v)] > (1 - s)N and for 
any two adjacent u,v e RM(n, d), (u, v) > 3N/4 5 . 

3.3 Uniform Sparsest Cut 

We next review some basics about the uniform sparsest cut problem along with the basic SDP 
relaxations. We will focus on the balanced separator problem; It is well known that integrality gap 
instances for balanced separator translate to similar gaps for uniform sparsest cut (see [DKSV] for 
instance). 

Throughout this work we shall view graphs as given by a normalized adjacency matrix and 
will often view them as specified by (and specifying) a random walk on the set of vertices. Given 
a graph G and a subset S of vertices, let the conductance of 5, ficiS), be the probability that a 
random edge out of a randomly chosen vertex of S lands outside of S . Let p G denote the stationary 
distribution of G. 

Definition 3.9 (Balanced Separator Problem). Given an undirected graph G and a parameter b € 
(0,1/2), find 

<P(G,b) = min{0 G (5) : p G (S) 6 [b, 1 - b]}. 

For our purposes it suffices to study the question where b is any fixed constant, say b = 1/3. 

We next describe the standard semi-definite relaxation of balanced separator problem with sev- 
eral rounds of the Sherali-Adams hierarchy. For a distribution p on R m and A c [m], let margin^^) 
denote the marginal distribution of p on the coordinates in A. Finally, for two distributions p, v, let 
\\p - v||i denote the statistical distance between them. The SDP relaxation we consider is given in 
Figure 3.3. Intuitively, the relaxation can be seen as starting with the "standard" SDP for balanced 
separator and placing all possible local integrality constraints on any set of size at most R. We refer 
to [RS2] for more detailed motivation for the hierarchy. For our purposes, it mainly suffices to say 
that the Goemans-Linial SDP considered by Arora, Rao and Vazirani [ARV] lies within a constant 
number of levels of SA R as described in Figure 3.3. 

Given a collection of vectors (vj : i e \V(G)\) and integral distributions {p s : S Q [n], \S \ < R) 
we call the pair SA R -feasible for balanced separator if together they satisfy the last four constraints 
of the SDP in Figure 3.3. 

4 This follows as G is a Cayley graph and we have to quotient out the dual of RM(n, d) which is RM(n, n — d—V). 

5 Strictly speaking, the graph of Barak et al. has edges {u, u} with (u, v) < 3N/4, albeit with exponentially small 
weights. We enforce this condition as it can be done without any change to the theorem as stated and it avoids some 
annoying technicalities. 
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SA^-Hierarchy. Input: Graph G, b e (0, 1/2), R - number of rounds. Variables of the SDP are in 
bold and p G denotes the stationary distribution on G. 

1 , 
minimize E - \\v t - vM 

(i,j)~E(G) 4 

1 9 

subject to E - \\vi - Vj\\ 2 > 2b(l - b) 

(i,j)~PC 4 

{v u vj) = E [xiXj], S c V(G), \S\<R, i, jeS, 

x~ns 

(v t , v ) = E [ Xi \, S c V(G), \S\<R, ieS, 

||margin AnB (/x A ) - margin Anfl (/x B )lli =0, A, Be V(G), \A\, \B\ < R, 

Us a distribution on { 1 , - 1 } s S c V(G), |5 1 < R. 



Figure 1 : SDP relaxation of balanced separator in Sherali-Adams hierarchy 

We shall use the results of Raghavendra and Steurer [RS2] and Knot and Saket [KS1] that show 
how to lift gap instances for the basic SDP to higher rounds of the Sherali-Adams hierarchy. We 
will mainly follow the framework of Raghavendra and Steurer. Below we state one of their results 
with a view towards our application to the balanced separator problem. 

Definition 3.10 ([RS2]). A nice system of clouds is a collection S of subsets ofH d with the follow- 
ing properties: 

- Every set B £ S consists ofN unit vectors. The sets B 6 S are referred to as clouds. 

- Near Orthogonality: For every B e < B, and every unit vector u 6 R rf , 2„ eB (w, v) 2 < 3/2. 

- Matching Property: For every pair of clouds (A,B) e IB, there exists a matching M : A — > B 
such that for every u e A, M(u) = argmax^ \{u, v)\. 

- Integrality: All vectors in S are elements of [A, -A} d for some fixed A. 

The following result of Raghavendra and Steurer says that given a nice system of clouds one 
can get feasible solutions for several rounds of the Sherali-Adams hierarchy whose "geometry" 
(especially mutual correlations) is close to that of the clouds. 

Theorem 3.11 ([RS2]). ForR, t > 0, let 6 = 10 • R 2 ■ e~ t/l6R . Then, for every nice system of clouds 
S over R rf of size N each, there exists a SA R -feasible pair of vectors {v B : B e S) and distributions 
(Us over {1, -\} s : S c S) such that the following holds. The vectors (v B : B e S) are of the form 



= Vl - 6 ■ normal 



1 1 N 
— — > u 



V Viv ueB ) 

where 6 (ni : B e !B) is a set of unit vectors orthogonal to the vectors {u®* : u 6 U5}. 



6 For a vector w e R" ! , normal(u;) denotes the unit vector in the direction of w. 
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4 PRGs for Lipschitz Functions of Polynomials 



We now give a PRG for Lipschitz functions of low-degree polynomials, proving Theorem 1.2. 
To avoid some minor technicalities, throughout we shall assume that the Lipschitz functions we 
consider are smooth in which case \\if/\\up = sup v \if/'(x)\. 

Recall the hashing generator, G<n,G h ,F, defined in the introduction, Section 2.1. We show that 
for suitable setting of t, 6, G<h,g ; , fools Lipschitz functions of polynomials. 

Lemma 4.1. For every e > 0, t = \6£ 2 /s 2 , 6 < s 4 /64£ 2 , *H pairwise independent, the following 
holds for G = G<H, Gh ,F ■ {0, l} r -> {0, 1}". For every degree t polynomial P : R" -» R with \\P\\ = 1, 
and if/ : R — » R a smooth function with bounded \\i{/\\Lip, 



In comparison, a similar result in [MZ] gets an error bound of 2 0{e) ■ s. Also note that the above 
result requires no assumptions about P and in particular P need not be regular as required in [MZ]. 

Proof. Let X e u {1,-1}" and let Y = YQi,Z l , . . .,Z',F) be the output of the generator, where 
h e u H and Z denotes the samples generated from G/, to set the variables in bucket i, and F the 
output of our 2^-independent family. For brevity, let Z = (Z 1 , . . . , Z'). 

Fix a hash function h : [n] — > [t], and call a subset / c [n] h-bad if max 7 - eW |/ n /i _1 0')l > 1- Let 



with Yii a ] = 1- Let Ph '■ R" —* R be the degree J polynomial obtained by deleting all h-bad 
monomials in P: P h = Z/ : /notA-bad a / O/e/ We first show that the randomness in Z is enough to 
fool the polynomial iV 

Claim 4.2. For any fixed hash function h, 



Proof. We begin by showing that for any fixed values of h and F, and for any s e R that: 



We will prove this claim by a hybrid argument. The main intuition is that P h is affine in 
the variables of a single bucket and so should be fooled by the PRG for half spaces G/,. Fix a hash 
function h and let random variable T ' = (X 1 , . . . , X\ Z' +1 , . . . , Z') for < i < t, where X> e u { 1 , - 1 } m 
and are independent of one another. Note that Y° = Y and Y* = X e„ {1,-1}". Further, Y l ~ l , Y' 
differ only in the fth bucket which is X' in Y 1 ' 1 and Z in Y i . 

Fix i e [t] and let Z denote the variables not in the fth block. Without loss of generality suppose 
that B l = {j : h{j) = i} = [m]. Then, as a function of the variables in B l , Ph is affine: 



P h (x u x m , Z, F) = xy P X (Z, F) + x 2 P 2 (Z, F) A h x m P m (Z, F) + P (Z, F) := e z , F {x), 



E ty(P(x))\ - ^ E i)( [W(G(y)))] < \M\up ■ e. 




iel 



E[if,(P h (X))] - E [HPk(Y))] < W\\up 



■ yftd. 
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where Pi, ... , P m , P are polynomials in Z and F of degree at most I + 1. 
By assumption on the generator G h , we have that: 



Thus for each i, 



|p(p,(r')< s )-p(p,(r' +1 )<5)|<5. 

Iterating the above t times yields 



p (P h (X) < S ) - p (/yy) < 5) 

X~„{1,-1}" y~G-H,G h ,F 



Since F is 2^-wise independent, so is G<H,c h ,F- Thus, by Chebychev's inequality, we have that 
for any t 

p (\P h (X)\>s) p (|p ft (y)| > j) < i 

Therefore, for a fixed /z, by partial integration we have (we assume ^ is bounded), 

I Z^ 00 

|E[^(P ft (X))] -B[tff(P h (Y))]\ = i//(t)-(P[P h (X) > s]-P[P h (Y) > s]) ds 

I \J — CO 

< Mi* • IP [P h (X) >s]-P [P h (Y) >s]\ds 

\J — CO 

Xco 
min(^, s' 2 ) ds 
CO 

tfrff + 2||iAl| L; > ^ 2 rfs 

Jl/y/i6 



\Lip 



We now show that for a random h ~ ^H, P and Ph are close. 
Claim 4.3. |E W [<KP(7))] - E w ty(P h (Y))]\ < UWup ■ HI VO- 
Proof. Observe that for any y e R n , 

IW!/)) - HPh(y))\ < \m\u P ■ \P(y) - Ph(y)l 

Now, for any I c [n], P/j[7 is /z-bad] < I 1 It. Therefore, for any fixed hash function h, as Y is 
2^-wise independent, we get 

|E [HP(Y))] - E mP h (Y))]\ < ||<AIIl,> • EE [\P(Y) - P h (Y)\] 

h Z 



1 /2 / 

^•(ee[|P(F)-P,(7)| 2 ]) =U\\ Lip - E £ a] 



I: I is /i-bad 



1/2 



J] a 2 • P[7 is /z-bad] 



1 1/2 



U:|/l<d 



< UWup ■ -=. 
it 



□ 
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Note that the above can also be applied to X e„ {1,-1}". Therefore, combining the above two 
claims, we get 



E[^(P(Z))]-E[^(P(y))] 



E MPQO)] - E [ifr(P h (X))] 



X.h 



X,h 



+ 



B[^(P h (X))]-B[^P h (Y))] 



+ 



E[M)]-E[^(y))] 



Y,h 



YJi 



2U\\up-^l<h + A\\i}f\\ Up - <T5. 



The lemma now follows by setting t = \6£ 2 /s 2 , 6 = s A /64£ 2 . 



□ 



Theorem 1.2 follows from the lemma and the PRG for halfspaces of Meka and Zuckerman 
[MZ]. 

Proof of Theorem 1.2. Using the PRG for halfspaces of Theorem 3.4 as G\ u the generator G-h,G;„f 
as in Lemma 4.1 has a seed-length of r = 0(t ■ r h ) = 0((log n + \og 2 (£/s))£ 2 /s 2 ). □ 



5 Majority is Stablest over Reed-Muller Codes 

As mentioned in the introduction, our integrality gap instance for uniform sparsest cut builds on the 
short code gadgets of Barak et al. [BGH + ]. To do so, we first use our PRG for Lipschitz functions of 
polynomials to obtain a significant quantitative strengthening of the majority is stablest over Reed- 
Muller codes result of Barak et al. which in turn was based on the influential majority is stablest 
result of Mossel et al. [MOO]. We refer the reader to [MOO] for the motivation and history behind 
such results. The high-level structure of our majority is stablest result and its analysis are similar to 
those of Barak et al. who worked with the PRG for PTFs of Meka and Zuckerman [MZ] . However, 
the actual argument is somewhat delicate and we need to rework some technical arguments of 
Barak et al. We defer the details to the appendix. 

To state our result we first define the notion of influence to functions defined over the Reed- 
Muller code. For a function / : RM(n, d) — > R and a 6 F^ /RM(/z - d - 1) define the Fourier 
coefficient f (a) = B x ^ HMM [f(x) -Xa(x)]. 

Definition 5.1. For a function f : RM(n, d) — > R and i 6 [N], where N = 2 n and i > 0, the 

£ -degree influence of coordinate i in f is defined by 

Mf(f) = J] K a)1 - 

ae¥^IRU{n,n-d-\), 
dsg(a)^(, «, = 1 

We can now state our majority is stablest over Reed-Muller codes result. For /u e [0, 1], let 
t(ju) e R be such that Px-jvou)^ < t(ji)] = /u. Then, for p > define the Gaussian stability curve 
Y p : R -> R by T p (pi) = F X j[X < tip), Y < t(p)], where (X, Y) e R 2 is a two-dimensional mean 

zero random Gaussian vector with covariance matrix 
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Theorem 5.2. There exist universal constants c, C such that the following holds. Fix n,d > and 
e > 0. Let G = G(n,d,s) be the short code graph as in Lemma 3.8. Let f: RM(n, d) — > [0, 1] 
be a function on RM(n, d) with V, x ~HM(n,d)[f( x )] - f 1 an d max i£m Inf^ og (/) < r. Then, for 
d>Cloglog(l/r), 

clog log(l/r) 
x-rmm^ ""'~ J ' (1 -p)log(l/r) : 

where p = e~ E and Y p : R^Ris the noise stability curve of Gaussian space. 



e jf(x)Gf(x)] < r» + - \ *\ f [ ;_ s , (5.i) 



Qualitatively this is similar to the statement of Barak et al. [BGH + ]. However, quantitatively 
the above result is exponentially stronger in the requirement on the degree d of the Reed-Muller 
code. Barak et al. require d = Q(log(l/r)), whereas we only require d = Q(loglog(l/r)). This 
improvement is critical for our improved integrality gap instances and could be of use in other 
applications of the short-code. We defer the proof to the appendix. 



6 Integrality Gap Instances for Uniform Sparsest Cut 

As mentioned in the introduction, roughly speaking, our gap instance is obtained by replacing the 
long code with the short code in the construction of Devanur et al. [DKSV]. We bound the integral 
value of the instance using Theorem 5.2 and bound the SDP value by applying the framework of 
Raghavendra and Steurer [RS2]. 

Fix a degree parameter d and s > to be chosen later and N = 2". Let G = G(n, d, s) denote 
the short code graph as in Lemma 3.8. Our candidate gap instance is obtained by folding G along 
orbits of the affine shift action. To this end, let H be a subgroup of permutations of [N] acting on 
F^ such that the following properties hold: 

- RM(n, d) is closed under the action of H: For every v 6 RM(n, d), and n e H, n(v) 6 RM(n, d) 
(n(v) is the string obtained by permuting v according to n). 

- H is transitive: For all a,b e [N], 3 n 6 H such that n(a) = b. 

- H acts as automorphisms on G: For all u,v e RM(n, d), and n G H, the weight of the edge 
{u, v} in G is the same as the weight of the edge {n(u), n(v)} in G. 

Given such a subgroup of permutations, define the H-folded graph Gr in the natural way by 
collapsing the orbits {tt(v) : n 6 H] into a single vertex for every v e RM(n, d). Specifically, for any 
adjacent u, v in G with an edge-weight of w G (u, v) we add an edge of weight w G (u, v) between the 
orbits {n{u) : n e H} and {n(v) : n e H} in G H - 

For Reed-Muller codes, a natural group of permutations H as above is given by affine shifts: for 
every a e FJJ, consider the permutation n on [TV] (recall that Af = 2" and we identify [N] with Fj) 
given by the operation x — » x + a. It is easy to check that H satisfies properties (1) and (2) above. 
Property (3) follows from Lemma 3.8. We will show that the graph Gh has a large integrality gap 
for the balanced separator problem- Theorem 6.10. 

In the following, we denote vertices of G H by bold letters v, which we interpret as correspond- 
ing to orbits {n(v) : n e H) for vertices v e G. Further, as G is regular, to sample from the 
stationary distribution of Gh it suffices to sample v ~ RM(/i, d) and look at the corresponding orbit 
{n{v) : n 6 G H }- We state this below for later use. 
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Fact 6.1. For Gh as defined above, the stationary distribution on Gh is the same as the distribution 
of orbits [n{v) : n e G H ), where v ~ RM(n, d). 



6.1 Bounding the Integral Value 

We first bound the value of the integral solution of balanced separator on G#. We do so by appeal- 
ing to Theorem 5.2. 

Lemma 6.2. There exists a constant C such that the following holds. For d > C log log N and 
s = C((log log AO/ log A0 2/3 and t = s2 d+l the following holds. For G = Gin, d, e) as in Lemma 3.8 
and H as defined above, every 1 /3-balanced cut in Gh cuts at least £l( -\/s) fraction of the edges. 

Proof. Let p be the stationary distribution on Gh- For b e (0, 1 /2), let /' : V(Gh) — > {0, 1} define a 
^-balanced cut on G H , i.e., p ({v : f'(v) = 1}) € [b, 1 — b\. Lift the function /' to all of RM(n, d) in 
the natural way as follows: define / : RM(n, d) — » {0, 1} by f{v) = f'(v). From Fact 6.1, it follows 
that / is ^-balanced: 

E [fm=f* 6 [1/3,2/3]. 

v~RM(n,d) 

Further, observe that the fractional weight of edges cut by /' in G H is exactly the fractional 
weight of edges cut by / in G. We next show that / has small influences in order to apply 
Theorem 5.2. 

Let D = 2 d . Recall that for I < D/2 and i e [N], by Definition 5.1, 



Inff \f) = Yj 



aeF~, \a\<t, ar;=l 

where /(or) = E^ RM(n>(i) [/(x) 

We will show that Inff (/) = Inff (/) for all i, j e [N]. Fix i± j e [N] and let n e H be such 
that 7r(i) = j (such a permutation exists as H is transitive). Then, 

/W«))= E \f(x)- X<a) (x)] 

x~RM(n,d) 

= E Jf(n(x))-x«aMx))] 

x~RM(n,d) 

= E [/(x) • y a (x)] {f is constant on orbits under H) 

x~RM(n,d) 



/(or). 



Therefore, 



Inff(/)= Ya f {a)1= Z M<*)? = Yj 

= 2 /OS) 2 = Inff (f). 
Further, note that for any function /, 
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Therefore, Inf f { (f) < t/N. 

Let I = logN, t = (log N)/N. Then, from the above equation Inf^ log(1/T) < r. Thus, by 
Theorem 5.2 applied to /, for d > Cloglog(l/r) and s small enough so that 1 - s < e~ E = p < 
I -ell, 



e jfwfw < W + 7 f™£^ < i - n ( Vi) + < i - oc V5). 

.v~RM(n,rf) (1 -p)log(l/r) elogN 

where the last-but-one inequality follows from known facts about Y p - see [MOO], [BGH + ], and 
the last inequality from setting the value of s as in the lemma. 

The above bound immediately translates to a bound on the fraction of edges crossing the cut 
defined by {x : f(x) = I}. In particular, / cuts at least a Q.(s/s) fraction of the edges in G. 
The theorem now follows as the conductance of the cut defined by /' in G# is the same as the 
conductance of the cut defined by / in G. □ 



6.2 Bounding the SDP Value 

We next bound the value of the natural SDP relaxation for balanced separator as given in Figure 3.3 
on the graph G H . Most of this section is devoted to proving the following. 

Lemma 6.3 (Main SDP value). There exists a constant C such that the following holds. For 
d > CloglogN and s = C((loglogiV)/ logA0 1/3 , R < (logA0 O(1) the following holds. For G, Has 
in Lemma 6.2, the R'th round of SA R -hierarchy relaxation defined in Figure 3.3 for the balanced 
separator problem has objective value at most 0(Rs log(l/e)). 

We prove the lemma by exhibiting a SA R -feasible solution with low objective value. We do 
so by first constructing a nice system of clouds as in Theorem 3.11 and then applying the theorem 
to get a SAtf-feasible set of vectors and distributions. We then use the properties of G H and the 
system of clouds we construct to bound the objective value. 

For v 6 RM(n, d) Q {I, -1} N , let B(v) c R^ 3 , be the (possibly multi-)set of vectors 

B(v) = ((n(v)/ : n e //J . (6.1) 

Let & = (B(v) : v e RM(n, d)) denote all the clouds. We first show that by throwing away few 
of the clouds B(v), we get a nice system of clouds in the sense of Theorem 3.11. 

To this end, for v e RM(n, d), call the cloud B(v) nearly-orthogonal if the following holds: 

max\(v,n(v))\<N 2/3 /2. (6.2) 

neH 

The following shows that a nearly-orthogonal cloud satisfies the near orthogonality property of 
Theorem 3.1 1. 

Claim 6.4. For any nearly orthogonal cloud B(v) e < B, max^^ 2ZueB(v)( w ' u ) 2 ^ 3/2. 

Proof. We can bound the max in the lemma in terms of the largest eigenvalue of the Gram matrix 
of the vectors B(v), M(v) e R NxN defined by M(v) ab = {n a (vf 3 ,n b (vf 3 )/N 3 for a,b e F%. Now, 
by (6.2), M(v) ah < l/8iV for a + b. Thus, the off-diagonal entries of M(v) are at most l/8iV and 
the diagonal entries are 1. Thus, by Gershgorin theorem, the largest eigenvalue of M(v) is at most 
1 + (N - l)/SN < 9/8. The claim now follows. □ 
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We next argue that most of the clods B(v) e & are nearly-orthogonal. This follows from a 
straightforward application of Lemma 3.5. 

Claim 6.5. For random v ~ PM(n,d), the cloud B(v) defined by (6.1) is nearly-orthogonal with 
probability at least 1 - N~ n(2 \ 

Proof. Fix any a e F£. Then, for v = (P(x)) xer n, 

.veF',' 

By Lemma 3.7, for d > 2, a ± 0, and P a random degree d polynomial, the random variables 
((-l) P(l)+P(A+c,) ) AeF ^ are £-wise independent for k = 2 d ~ l . Therefore, by Lemma 3.5, for t = N l/6 /2, 

P f|<i;,^(i;)>|>JV 2/3 /2l<** /2 /<*. (6.3) 

Thus, by a union bound, for v ~ RM(n, d) B(v) is nearly-orthogonal with probability at least 1 - 

We next show that the clouds B(v) satisfy the matching property. 
Claim 6.6. For u, v e RM(rc, d), B{u), B{v) satisfy the matching property. 

Proof. Let n, cr e H be such that \(n(u), cr(v))\ = max u > € B(u),u'eB(v) \(u', v')\. Then, for any p e H, 
(p(n(u)), p(cr(v))) = (n(u),cr(v)). Therefore, B(u),B(v) satisfy the matching property, by consider- 
ing the matching {(p(n(u)), p(cr(v))) : p e H} between B(u), B(v). □ 

It is clear that the clouds B(u) satisfy the integrality property. Let S' = {B(v) : v e 
RM(n, d), v is nearly-orthogonal as in (6.2)}. Then, from the above arguments, S' satisfies the hy- 
pothesis of Theorem 3.11. Let (v B : B e S') and (p, s over {1, -\} s : S Q &') be the SA R -feasible 
pair of vectors and distributions as guaranteed by Theorem 3.11 for an odd integer value t to be 
chosen later. Then, for 6 = R 2 exp(-Q(t / R)) as in the theorem: 

+ • (6.4) 

where (itg : B 6 S) is a set of unit vectors orthogonal to the vectors {u®' : u e UB}. We thus 
have vectors and distributions for all nearly-orthogonal clouds. We extend these to all clouds of 
S arbitrarily: fix a good cloud B and for every cloud B e S\ let v B = v Bq . As will be clear 
later, this will not effect any of our arguments quantitatively as most clouds are nearly-orthogonal 
by Claim 6.5. 

The above arguments give us a vector for every cloud B e S. As clouds B e S naturally 
correspond to orbits of the action of H on RM(n, d), we get a vector for every vertex of the graph 
G H . Similarly, the distributions (fj, s over {1, -1} S : S c £>) naturally extend to distributions on 
subsets of vertices of G H . We show that this collection of vectors and distributions give us a 
good feasible solution for the SDP in Figure 3.3 on Gh- In the following we shall freely translate 
between vertices of the graph G H and the clouds B e S - the meaning will be clear from context. 



v B = Vl - 6 ■ normal 



— V 
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Theorem 3.11 already shows that the above candidate solution satisfies the last four constraints 
of Figure 3.3. We next show that (v B ) satisfies the balance condition (first constraint in Figure 3.3). 
To do so we shall use the following technical claim which helps us relate the correlation between 
vectors Vb( u ),Vb(v) as defined by (6.1) to the correlation between u, v. We defer the proof to the 
appendix. 

Claim 6.7. Let B(u), B(v) be nearly-orthogonal clouds. Then, 



Claim 6.8 (Balance property). Let p be the stationary distribution on G H . Then, for 2 d > 3t, and 
vectors (v B ) BeS as defined by (6.4), E B , B -^ \\v B - v B ,\\ 2 IA > 1/2 - 0(6) - - N~ m . 

Proof. We start by observing that by Fact 6.1, the distribution of (B, B') for B, B' ~ p is the same 
as that of (B(u), B(v)) for u, v ~ RM(n, d). We then focus on near-orthogonal clouds B(u), B(v) as 
most clouds satisfy this property by Claim 6.5. For nearly-orthogonal clouds B(u),B(v), we use 
Claim 6.7 to bound the correlation between v B ( u ), v B{v) by looking at correlations between vectors 
u, v. Finally, we use the fact that u, v ~ RM(n, d) are 2 J -wise independence to show that they are 
almost orthogonal to one another. 

Let u, v ~ RM(n,d) and let S denote the event that u, v are nearly-orthogonal. By Claim 6.5, 
P[fi] > 1 - N' n(2 \ Further, by Lemma 3.7, u, v are 2 rf -wise independent as strings over {1,-1}. . 
Thus, for 3t < 2 d ~ x and t odd, for any fixed cr £ H, 




(6.5) 



Further, if in addition (u, v) > 3N/4, then 



(v B (u),v m ) > ««, v)/N) 



3t _ 2 -cm _ 0(< 5). 



(6.6) 



= E \{u', o-(v')f] = E Uu, o-(v)f] 



= P[fi] • E [<«, o-(v)) 3t | fi] + P[-.fi] E [<u, o-(v)) 3 ' | S] 
= P[fi] • E [(u, o-(v)) 3 ' | fi] + N'^ ■ N 3t . 



Therefore, E [<u, cr(v)) 3t \ fi] > -A^-" (2</) • N 3f . Combining this with (6.5), we get 

E [(v m , v m ) | fi] > -N~ n(2d) - N- n{,) - 0(6). 



Finally, as all of the vectors v B{u) are unit vectors, we get, 



E [(v m , v m )] > P[fi] • E [(v m , v m ) | fi] - P[-,fi] > -N- a(2d) - N- n(t) - 0(6). 



Therefore, 



E \\v B -v B ,\\ 2 = 2-2 E (v B ,v B ,)=2-2 E (v B(u) ,v m ) > 2-0(5) -N'^ -N~ [ 

B'~p B,B'~p u,v~RM(n,d) 



The claim now follows. 



□ 
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We next bound the objective value for the set of vectors (v B : B e S). Recall that there is a 
natural association between vertices of the graph G H and the clouds of S. In the following, for a 
cloud B, let B' ~ Gli B denote the cloud corresponding to a random neighbor in G H of the vertex 
associated with cloud B. 

Claim 6.9 (Objective value). Let p denote the stationary distribution on G#. Then, for s < 1/4 
and vectors (v B ) BeS as defined by (6.4), B B ~ P , b>~ c , h b \\v b - v B ,\\ 2 < 0(ts) + 0(6) + 2~ n{,) + N~ n(2d) . 

Proof. We first simplify our task by moving from the folded graph G# to the original graph G. 
From Fact 6.1, the distribution of (B, B') for B ~ p and B' ~ Gh B is the same as that of (B(u), B(v)), 
where u ~ RM(n, d) and v ~g u (i- e -> v is a random neighbor of u in G). Thus, to prove the claim, 
it suffices to bound E M „ RM(nid)j U „ GU \\v B{u) - v m \\ 2 . 

The high-level argument is now similar to that of the proof of Claim 6.8. We focus on near- 
orthogonal clouds B(u) as most clouds are nearly-orthogonal. We then argue that for nearly- 
orthogonal clouds B(u), B(v) with v ~ G u, (B(u), B(v)) is large as (u, v) is large. The last fact 
follows from the properties of the short code graph G. 

Let u ~ RM(n, d) and v ~g u. By, Lemma 3.8, (u, v) > 3N/4. Let & be the event that u, v are 
nearly-orthogonal. Then, by (6.6), 

E [{v m , v m ) 1 6] > E [« M , v)/N) 3t | fi] - 2~ m) - 0(8) 

> E [««, v)/N) 3t ] - 2 P[-.fi] - 2" n(,) - 0(5) 

> (\-sf t -N- n{2i) -2- m -0(6) 

>l-0(ts)-N- n{2d) -2- n{,) -0(6), 

where the last-but-one inequality follows from Lemma 3.8 and the power- mean inequality. There- 
fore, by Claim 6.5, 

B[(v B(u) ,v m )] > nSm[(v m ,v m )\S] - P[-.fi] > 1 - 0(ts) - 0(6) - 2~ m) - N^. 
Hence, 

E \\v B(u) - v m \\ 2 = 2 -2B(v m , v m ) = 0(ts) + 0(6) + 2~ a(t) + N~ a(2d) . 
The claim now follows. □ 

We are now ready to prove the main claim of this section. 

Proof of Lemma 6.3. The lemma follows from setting the parameters appropriately in Claims 

6.8, 6.9. Let us use the same notations as in Claims 6.8, 6.9. Recall that in Theorem 3.11 
6 = R 2 exp(-Q.(t/R)). Therefore, by choosing t = 0(Rlog(l/s)) sufficiently large, by Claim 

6.9, we get 

E \\v B - v B ,\\ 2 = 0(Rslog(l/s)). 

B~ P ,B'~g h B 

Similarly, for C a sufficiently large constant, 2 d > 3t. Thus, by Claim 6.8, we get 

E \\v B -v B ,\\ 2 /4 > 2b(\ -b). 

B,B'~p 

Therefore, the vectors (v B : B e S) form a feasible solution for the SDP in Figure 3.3. The lemma 
follows. □ 
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6.3 Gap Instance 

We are now ready to state our main integrality gap result for balanced separator problem. This 
follows immediately from Lemmas 6.2, 6.3. 

Theorem 6.10. For all n, there exists a graph G on M -vertices such that for all R, the integrality 
gap for the balanced-separator problem on G after R rounds of SA R -hierarchy is at least 

exp(fl( VloglogM)) IR. 

Proof. Consider the graph G H as in Lemmas 6.2, 6.3. Then, as d = Clog log N, the number of 
vertices M in G H is 

M < |RM(n, d)\ < exp(n d ) = exp ((log A0 rf ) = exp (exp ((log log AO 2 )) . 

Thus, log N = Q(exp( ^loglogM)). We can thus assume R < log N, as else the claim becomes 
trivial. 

Further, by Lemmas 6.2, 6.3, for s = Q(((loglog AO/ log A0 2/3 ), the integrality gap after R 
rounds of the SDP in Figure 3.3 on G H is at least 



1 



^e^loga/e) 
The theorem follows. 



Q 



( flog JV) 



1/3 \ 



^(loglogAO 2 



= Q. 


U\ogN) llA \ 


= n 


) 


{ R J 


V 



'exp(VloglogM/4) > 



R 



□ 



The main integrality gap results stated in the introduction follow from the above theorem and 
the well-known relation between balanced separator and uniform sparsest cut (see [DKSV] for 
instance). 

Proof of Theorem 1.3. Follows from fact that the Goemans-Linial SDP can be realized within a 
constant number of rounds of SA R -hierarchy and setting R = 0(1) in the above theorem. □ 

Proof of Corollary 1.4. Follows from the above theorem and the known equivalences between the 
integrality gaps for the Goemans-Linial SDP and distortion of embedding negative-type metrics 
into £\. See [Rab], [CKN] for instance. □ 



Acknowledgements. We thank Prasad Raghavendra and David Steurer for valuable discussions. 
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A Majority is Stablest over Reed-Muller Codes 

We now prove Theorem 5.2. To do so, we first use our PRG for Lipschitz functions to show an 
invariance principle for low-degree polynomials over the Reed-Muller code along the lines of the 
invariance principle for low-degree polynomials of Mossel et al. [MOO]. Specifically, we show a 
statement similar to Lemma 4.1 for the function £ : R — » R defined by £(jc) = (min(0, x, 1 - x)) 2 . 

Let G<fi,Gf, i G<n,Gh,F be as in the statement of Lemma 4.1. As shown by Barak et al. [BGH + ], 
we use the fact that Reed-Muller codes of degree D can be realized as instantiations of G<h,g,, 
for D > \og(kf) where G/, generates a &-wise independent distribution on {1,-1}"'. Further, if 
G<H,G h were already a 2^-wise independent family obtained by sampling uniformly from a vector 
subspace over F 2 , as is the case for Reed-Muller codes, we may take F = G<h,g a an d thus have 

G<H,G h ,F = G<H,G h - 
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Further, by [DGJ + ], fc-wise independent distributions fool halfspaces with error at most 
0{ -y/(log kjjk). Combining these observations with Lemma 4.1, we get the following invariance 
principle for the function g. 

Theorem A.l. There exists a constant C3 such that for every £, e > 0, and d > C3log(^/e) the 
following holds. For every degree I polynomial P : R" — > R with \\P\\ = 1, n = 2 m , 



E [<T(P(X))]- E [f(P(7))] 



< S. 



Proof. Let ij/(x) = £(x) - x 2 . It should be noted that \tff\u P = 2. Since the Reed-Muller code is 
2^-independent, we have that Exe B {i,-i}" [-^C^O 2 ] = ^Ye u HM(m,d) \P(Y) 2 ]. It thus suffices to show that 



E [ifr(P(X))]~ E [ifr(P(Y))] 



< e. 



Let k = 0(£ 2 /s 4 ) and t = 0(£ 2 /s 2 ) be as in Lemma 4.1 so that G<h,g,„f fools if/(P( )) with error 
at most \if/\u P ■ e/2 = s when G h generates a fc-wise independent distribution. 

Now, for d > log(fcf) = O (log(^/e)), F e u RM(m, J) can be seen as an instance of G^^ GluF . This 
completes our proof. □ 

A similar result is shown by Barak et al. [BGH + ], however the error guarantee they get has an 
exponential dependence on the degree t of the polynomial P. This improvement is crucial for our 
applications. 

The proof of Theorem 5.2 is similar to the one of Barak et al. [BGH + ]. Unfortunately, we 
cannot use their proof as is, but need to rework their somewhat technical argument for technical 
reasons. The high-level idea is to use the invariance principle from the above result to derandoimze 
the result of Mossel et al. [MOO] for the Boolean noisy cube. For p > 0, let T p denote the Boolean 
noisy graph with vertices corresponding to {1,-1}^ and two vertices x, y e {1,-1}^ have an edge 
of weight p dHix ^\\ - py~ d H(x,y) ^ w h ere j w ( ; ) denotes the Hamming distance. We use the following 
lemma that follows easily from [MOO]. Below, we use expectation inner products for functions, 
i.e., for real-valued functions /, g on a universe U, f, g : U — » R, (/, g) = 'E x ~u[f(x)g(x)]. 



Lemma A.2. Let f: {1,-1}^ — » R be such that E/ = /i, E/ 2 < 1 and E£ o / < 77. Suppose 

1 \ l0gl0g(l/T) 



Inff og(1/r) / < t for all i 6 [N]. Then, 



(f,T p f)<r p (p) + 0(n) + 



l-p) log(l/r) 



where T p is the Boolean noise graph with second largest eigenvalue p and Y p is the Gaussian noise 
stability curve. 

Proof of Theorem 5.2. Lets = 1/ log 2 (l/r), 6 = 1/ log 2 (l/r), t = log(l/r). Let J = Cloglog(l/r) 
for C sufficiently large to be chosen later so that (for C3 as in Theorem A.l) d > c 3 \og(£ls), £ < 
6 2 2 d+l . 

For a e F^ /C, let A a be the eigenvalues of G. Then, by Lemma 3.8, 

\A a - p k \ < 6, for k<£, \AJ < p m , for k > €. (A. 1) 
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Let y < 1/8 be a parameter to be chosen later. Let g = G r f and G' = G 1_2r . Then, the graph G' 
has the same eigenfunctions as G - Xa for a eF^/C with eigenvalues A' a = A^ 7 . From the above 
equation, it is easy to check that, for p' = p l ~ 2y , 

K - (P'f\ < yfS, for k < £, \X a \ < (p') e/2 , for k > £. (A.2) 

Now, decompose g = g <f + g >e into a low-degree part gr <# = 2aeF^, wt(aK£ an d a Wg^- 

degree part g >f = X^/c, A (<*,cw sKo^- Then , 



(f, Gf) = <£, G'<7) = G'^> + (g >c , G'g >l ) < G'g**) + n ■ 
Hence, using Equation (A.2) (and the crude bound // < 1), 



max 



A' 



(A3) 



In the remainder of this section we shall view RM(n, d) as a subset of {1,-1}^. Then, as g 
is [0, l]-valued on RM(n,d) and £ measures distance to bounded random variables, by Equation 
(A.l), 

E [C(<r®)\ < E [{g(z) - gftof] = E [(g-(z)) 2 ] = E [(^(z)) 2 ] < max (^) 2 < p* 
Hence, by Theorem A.l, (recall that £ = log(l/r), e = 1/ log 2 (l/r)) 



Now, as RM(n, d) is ^-wise independent (^ < 2 d+1 ), 

E m f^U)l = E L«(z)l = E :\g(z)] ± E f(^(z)) 3 



1/2 



x~{l -1} W J z~S L J Z ~S z~S 

Observe that g^ is a multilinear polynomial of degree at most £ and as the ^-degree influences of 
g are at most t, g^ is r-regular. Therefore, by Lemma A.2, 

G(loglog(l/r)) 



(g*, T p ,g^) = J] (pV (a) g(a) 2 < I>(p + ^) + 0( n ) + 



or;wt(a)<^ 



(l-p)log(l/r) 



(A.4) 



Since T p ,(p. + ^/rj) < T p >(jj.) + 2^/rj and T p (/u) < T p >(ji) + |p -p'|/(l -p) (cf. Lemma B.3, Corollary 
B.5 in [MOO]), it follows from (A.3), (A.4) that 



(f,Gf) = (g,G'g)<Y p (ju) + 



\P-P'\ 
1-p 



(1 -p)log(l/r) 



1-p 

0(j 

Lpressj 
Q(loglog(l/T)) 

(l-p)log(l/r) 



= w + Q(loglog(1/T)) + o { ylog(1/p) + + + c^ 2 

pW (l-p)log(l/r) 1 ' 

(Here we used the estimate |p - p'| = |p - p 1 " 27 ! = 0(y log(l/p)).) Thus, for y 
C log log(l/r)/ log(l/r) for sufficiently large C, the above expression simplifies to 

</,G/><r» + 



This completes the proof of Theorem 5.2. 



□ 
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B Missing Proofs 



Proof of Lemma 3.5. By Khintchine inequalities and the fc-wise independence of X,'s we get 
E[| Yii x i\ k ] < k k/2 N k/2 . The claim now follows by Markov's inequality. □ 

We now prove Claim 6.7 from Section 6. We use the following auxiliary claim. 

Claim B.l. For any nearly-orthogonal cloud B{u), || Z H < eB ( H )(w')®1l/ ViV > I - N~ n(t) . 

Proof. For any nearly-orthogonal cloud B(u), and U\ ± u 2 e B{u), by (6.2), |(m 1; w 2 )| < (iV _1/3 ) 3 = 
1/7Y. Therefore, 

2 

J («T > J] H (i/) lf- Z K»i>«2>r>iV-7Y 2 M 

u'eB(u) u'eB(u) u\+U2&B(u) 

The claim now follows. □ 
Proof of Claim 6.7. Fix two nearly-orthogonal clouds B(u), B(v) e S. By (6.1), 



VeB(w) v'eB(v) I \tteH creH ' 

= |f = |f Z <«•-<»» 

» o-eif ' o-€i? 

Now, by Claim B.l and the above equations we get, 



^normal 



2 (u')A, normal! £ (,'H) = (l ±AT QW )( £ (u'f\ £ ^ 



VeB(«) / 



H'eB(i() 

-(i±«^») 4 Z <»•«*)> 



3f 



(B.l) 



Finally, note that in (6.4) the vectors are orthogonal to all of the cloud vectors. Then, by 
(6.4) and (B.l) 

(v B{u) , v m ) >(l-6)(l- N- n(t) ) ^ (u, cr(v)) 3t - 5. 



Equation (6.5) now follows. 

Equation (6.6) follows from a similar claim in [RS2]. We omit the proof here. 



□ 
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